H gen#meA 

Journals.ASM.org 

Draft Genome Sequence of Rhodococcus erythropolis DN1, a Crude Oil 
Biodegrader 

Alexander Shevtsov, Pavel Tarlykov, Elena Zholdybayeva, Dauren Momynkulov, Ainur Sarsenova, Nazira Moldagulova, 
Kuvat Momynaliev 

National Center for Biotechnology, Astana, Kazakhstan 

We report the 6,548-Mb genome sequence of Rhodococcus erythropolis strain DN1, isolated from the oil-contaminated soil in the 
Karagandy region of Kazakhstan. The draft genome sequence of strain DN1 might provide new insights into the genetic mecha- 
nisms of crude oil biodegradation. 
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^^doGGCdis erythropolis is a member of the aerobic Gram- 
positive bacteria, famous for their ability to catabolize a wide 
range of organic compounds, including alkanes, cycloalkanes, and 
various aromatic hydrocarbons (1-5). R. erythropolis strain DN1 
was isolated from oil- contaminated soil in the Karagandy region 
of central Kazakhstan. Cultivation experiments with 3% crude oil 
solution resulted in the solution being almost entirely (92.7%) 
degraded by DN1 after 30 days of incubation. In addition, DN1 
was able to degrade crude oil in the temperature range of 5°C to 
37°C and in high NaCl concentrations (up to 6.5%). 

The whole-genome shotgun (WGS) sequence data were re- 
trieved with Ion Torrent sequencing technology that gave approx- 
imately 130-fold coverage. A total of 78 contigs, with an N 50 length 
of 224,624 bp and a G+C content of 62.4%, were assembled using 
Newbler software version 2.3 (454 Life Sciences). The genome was 
annotated using the NCBI Prokaryotic Genomes Automatic An- 
notation Pipeline, released in 2013. The draft whole-genome se- 
quence of strain DN1 has 6,548,397 bases that constitute 5,979 
predicted coding sequences (CDSs) and 3 rRNA and 54 tRNA 
genes. The open reading frame (ORF) annotation by Rapid An- 
notations using Subsystems Technology (RAST) (version 4.0) (6) 
has indicated that many proteins are potentially involved in the 
metabolism of different organic compounds, including amino ac- 
ids and derivatives (588 ORFs), carbohydrates (520 ORFs), cofac- 
tors, vitamins, prosthetic groups, and pigments (453 ORFs), and 
fatty acids, lipids, and isoprenoids (386 ORFs); protein metabo- 
lism (237 ORFs); DNA and RNA metabolism (219 ORFs); and the 
metabolism of aromatic hydrocarbons (61 ORFs) and other com- 
pounds. In addition, an analysis of the DN1 CDSs revealed that 
many genes, including those for aliphatic and aromatic dioxyge- 


nases, are involved in the biodegradation of xenobiotics. It is rea- 
sonable to suggest that strain DN1 will be useful in the bioreme- 
diation of regular oil- contaminated soils and oil- contaminated 
soils with high salt concentration. 

Nucleotide sequence accession numbers. The whole -genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession no. AUZK00000000. The version described in 
this paper is the first version, AUZK0 1000000. 
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